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The objective of this research was to investigate the 
feasibility of introducing computer vision methods to the 
Eug construction site. The primary use envisioned is 
ect input Of Gata from a video medium to the computer for 
the purpose of productivity analysis. 

The first step described is a familiarization with a 
fundamental computer vision system. 

Then, the results of processing actual footage from 
construction sites and other, existing buildings are 
described. There are observations made as to the effect of 
۳۱-۳ ۲۰۱۱۳۲ 1۳0 9 the structure, lighting conditions and 
shadowing, and physical obstructions. These concepts are 
illustrated by digitized images of the structures observed. 

Proposals are then made as to methods to insure precise 
repeatable placement of observation cameras. The alternate 
proposal is translation of images obtained from different 
camera positions through the use of on-screen reference 
pointem 

Finally, a summary of the physical barriers and 
technological probiems, and suggested courses of action, is 


provided. 
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CHAPTER 1 
INTRODUCTION AND BACKGROUND 
Iraditional Data Acquisition 
on the Construction Site 

Productivity can be defined as the ratio of input 
meer eezunits to output units (input/output). In the 
REI 02 21110931113 construction, this can be specified as 
nanhours-, machine hours-, or dollars-per-ton, -lineal foot, 
æ Vare foot, or -cubic yard. 

EROS tiv —Ommene construction site is currently 
determined by several means. The simplest, yet least useful 
for productivity improvement programs, is after-the-fact 
Pemparison Of total resources expended versus quantities in 
place. Several more useful methods involve data collection, 
wwe Sob site, of work in progress. This data could be in 
the form of:  Time-lapse photography; Work Sampling; or 
Manual timing of operations. 

The method of time-lapse photography involves the view- 
ing of video footage (or film) of construction operations. 
Observations are then made as to inefficiencies in proce- 
dures, wasted travel time to locations of work and mater- 
lals, and the actual production rate of work in place. 


Work sampling involves random surveys of the job site 


to observe whether "effective work" is underway. Effective 





work is defined as those activities that either actually 
place materials in the structure or directly support such 
activities. Ineffective work is everything else (standing 
around, drinking a soda, or transporting materials long 
distances). By gathering sufficient data on all activities, 
a picture of the effectiveness of the work force can be 
drawn. 

Manual timing of operations involves observers on the 
job site with stopwatches and notepads recording actual 
performance times of the activities of concern. This method 
is extremely manpower intensive. 

These productivity analysis methods all involve some 
statistical analysis of the job. Statistical sampling is 
necessary due to the large expenditure of time and capital 
involved in continuous observation. This large expenditure 
occurs not only during the collection of the data, but also 
during the required analysis. 

The collection of data is the first of four steps in 
the statistical sampling process. The remaining steps are: 
(2) organization of data, (3) analysis of data (determina- 
tion of "descriptive statistics"), and (4) interpretation of 
data ("inferential statistics") (Dillman 1981, pp. 7-8). 
Theoretically, proper evaluation of a statistical sample 
provides reasonably accurate results. 

The final three steps listed above are well suited for 


accomplishment by computer. However, manual data collection 





and manual input to the computer fail to utilize the full 


potential of the technologies being developed in the 


manufacturing industry. 


Historical Development/System Trends 


"Pattern recognition" can be defined as the computer- 
ized process o£ analyzing a sensory image and differentia- 
ting and recognizing the component parts of that image. 

This concept is widely used in the factory environment for 
ከኤ ህጅ ሆህ control and for. the control of robotics. 

The current state-of-the-art pattern recognition system 
requires that very distinct patterns be provided. In the 
factory this is accomplished by separating individual items 
and closely scrutinizing them under optimal lighting condi- 
tions. This is impractical on the construction site, where 
lighting is largely uncontrollable and the distances to the 
object are greater; but then the requirements at this stage 
are not to discern small individual components but to survey 
bulk quantities of production, e.g. square feet of wall, 
feet of pavement or pipe, etc. These differences in objec- 
tives and physical characteristics necessitate a different 
class of optics and recognition systems than those used in 


Che factory environment. 





Objectives of Study 


The objective of this research was to investigate the 


TEY of introducing computer vision methods ta the 


AMO construction site. This objective was identified 


as a primary consideration for introducing automated data 


ግ ከከከ ን ጋቦ for productivity analysis of the construction 


process (Thomas and Smith 1987). 


The tasks necessary to fulfill this objective are: 


IE 


Learn the capabilities of an entry-level computer 
vision system with regard to its ability to 
reconstruct a given image with varying levels of 
contrast. 

Select sample structure(s) for data collection. 
Determine the feasible angles of view for 
observation cameras, and their related distances, 
based upon the nature of the object observed and 
the specifications of the camera system. 

List the steps involved in correlating the 
dimensions of the visual image processed by the 
computer vision system with the actual dimensions 
of the observed object, a04 che corresponding 
areas. 

Identify the physical and technological barriers to 
effective use of entry level vision systems 


hardware in construction. 





Methods 

The first requirement was an in-depth literature search 
and review. Publications on computer vision and construc- 
tion management were reviewed for information on previous 
work on vision systems employed for productivity data 
momlection. In addition to published literature, course 
Mes from the Computer Vision and Inspection course offered 
at the Pennsylvania State University were reviewed as a 
primer on vision system techniques and applications. 

In conjunction with the literature search, the labora- 
tory exercises for the aforementioned computer vision course 
were performed to familiarize the author with the equipment. 

Field data collection was performed using a Panasonic 
video camera and recorder. Time of day, temperature, 
weather, and location were all considered prior to collec- 
tion of data. The following structures were considered as 
potential subjects: Mid-State Bank (under construction); 

8. b'LOrmhnotcl (under construction); and Centre Community 
AS pra not under construction). Upon selection of Centre 
Community Hospital for analysis, as-built drawings were 
obtained for additional information. 

Analysis was comprised of executing pixel counts on the 
images collected. These images were then qualitatively 


evaluated for indications of where system improvements are 





required, and where potential technological limitations 
exist. 

The findings of the study enabled the author to deter- 
mine that, although feasible, substantial work remains prior 


o prototype applications. 





Sieve ۳ 


HARDWARE AND SOFTWARE 


Determination of PCEYE 
system Capabilities 

The computer vision system utilized for this study is 
PCEYE, marketed by Chorus Data Systems. Its unit cost is 
under $1000, exclusive of the computer and camera equipment. 
The PCEYE system used for this study, as available in the 
Computer Vision and Inspection Laboratory (CVIL) of the 
Mechanical Engineering Department of The Pennsylvania State 
University, includes the following components: 

-IBM Personal Computer with 256K RAM, dual 360K drives 

-Color Monitor, using Color Graphics Adapter 

-PCEYE System Board 

-Black and White Video Camera 

-Black and White Video Monitor 

-Video Cassette Recorder (VCR) 

meson Dot Matrix Printer 
All laboratory work (e.g. image processing) was accomplished 


in CVIL using this equipment. 


iueut.characteristics 

The PCEYE system accepts as input the standard analog 
RS-170 signal, in common use for black and white video 
transmission, or the National Television Systems Committee 


(NTSC) signal, which carries the information for color 





images in addition to the RS-170 data. The NTSC signal is 


common to television and VCR transmissions.. The RS-170 
signal is illustrated in Figure 2.1. This signal causes the 
Scanning beam in a television picture tube to scan across 
the tube in consecutive horizontal lines from top to bottom. 
It does this thirty times each second. 

This continuous function (signal) is translated into a 
discrete (digitized) function by an analog-to-digital 
converter on the PCEYE system board. 

The PCEYE system is also capable of retrieving images 
which were previously saved to disk. These disk-saved 
images are currently what must be used for analysis and 
comparison. 

The laboratory setup used for course work typically 
feeds the signal directly from the black and white camera 
through the video monitor to the PCEYE board. For the pur- 
poses of this study, it was necessary, and in fact desir- 
able, to record the images on videotape at the building 
site, and replay them on a VCR in the laboratory for input 
to the system. This was desirable because it more precisely 
matches the probable procedure that will be used in 


construction practice. 


Resolution 


Pi nene ssesize. Grdisppness). The PCEYE system, as con- 


figured, breaks the image into 320 picture elements 





0 


————— 9 us 


| us 


10 us 


RS-170 SIGNAL (Carlson and Gisser, 
FIGURE 2.1 


63.5 us 


TIG) 
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«ከ ۱2۵ 1 by الا‎ 916۱۲ 5 vertically. Using the 


example of a building 100 feet tall which fills the screen 
vertically, each vertical pixel then equates to six inches. 

The resolution of the 320 by 200 system is the minimum 
recommended for use. Figure 2.2(a) is the upper left 64 by 
64 image extracted from the PCEYE image in Figure 2.2(b). 

Intensity. The existing system uses four grey levels 
to represent a digitized image. These four grey levels, or 
ةا‎ intensities, are displayed as four pseudo colors on 
the color monitor. In ascending order they are: black, 
blue (cyan), red (magenta), and white. 

The user can adjust the sensitivity of the digitizer 
until the image observed meets the desired intensity levels, 
the resultant image could be black and white, or four color, 
whichever brings out the desired details. The sensitivity 
is actually assigned by directing the lowest intensity that 
15 interpreted as white, and the highest intensity that is 
interpreted as black. The system then distributes the other 


two grey levels between these two limits. 


Image Processing 

Color versus Black and White. There is usually no 
problem inputing a color signal to a simple frame grabber 
such as PCEYE, since the color signal usually contains the 
same information in the same format as a black and white 


signal. Any problem that arises is based upon the limi- 
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tation that such a frame grabber is designed solely to 


handle black and white images. The extraneous color 
information can be misinterpreted, causing interference 
and/or a fuzzying of the image; therefore the color portion 
of the signal is normally filtered out. Systems that make 
use of the color signal are under development. What will be 
found in the initial development of color systems is to a 
greater degree what was found in grey level systems. The 
more grey levels there are, the more complex and slow the 
software becomes. The distinct advantages arise in that the 
system becomes more tolerant to variations in lighting, thus 
requiring less special lighting. For example, backlighting 
is usually required for a black and white, edge detecting 
system. 

For the system utilized in this study, the color of the 
object only plays a role in the development of an image 
inasmuch as the color can create a differentiation in the 
intensity of the light reflected by that object. 

Volume of i idu e ey . The PCEYE system 
package contains two programs which process the digitized 
image and report the number of pixels at each grey level. 
The images processed by these programs must have been 
previously saved to disk in a format different from the 
standard PCEYE format (see CAPTURE.BAS below). 

The first program processes the entire image (frame), 


which takes two to three minutes. The product is a histo- 
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gram showing graphically the number of pixels per grey 
level. 

The second program will process any rectangular portion 
of the image designated by the user, up to the full screen. 
This can take over an hour, but more detailed information is 
provided. The user designates what "colors" are part of 
the object, and is told the size of the object, its centroid, 
and if there are any "holes" in it (visually). This program 
was developed at the Pennsylvania State University and is 
not available with PCEYE. 

Image Enhancement. Using this system, the only prac- 
tical method of enhancing the image is proper adjustment of 
the black and white levels. These levels determine the 
upper limit of light intensity that will be interpreted as 
black and the lower limit interpreted as white. The user 
sets the limits by either entering the levels into the 
parameter table available in the PCEYE program, or by 


adjusting the image as it appears on the screen. 


Qutput 

By definition, the hardware has one output: digitized 
images. The software processes these images, which can be 
viewed on a monitor, printed on paper, or saved on disk. 
The value of the system lies in the various pixel counting 


and image comparison programs. 
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۱۳۱ Om en PCEYE program (in BASIC) whose main 
| ከ በ ካመን. the analog signal, then display and/or 
save the resultant digitized images.  DIFFPGM, SHOWPIC, and 
PRNPICS use the images saved by PCEYEIO. 

DIFFPGM.EXE compares two PCEYE images pixel by pixel. 
It then reports the number and percentage of pixels that 
۲ ۲۲ match. The sensitivity of the program is set by the 
user during each run, meaning that the user may choose the 
range of grey levels that can cancel each other. For 


example, choosing zero for the sensitivity means only exact 





matches cancel one another (blue cancels only blue). A 
sensitivity of one means pixels within one level of each 
other will cancel (blue could now also cancel black and 
red). Obviously, zero should be the only feasible setting 
۳ 2 system, since "0 can be a great difference 
between the extremes of light intensity of adjacent grey 
Mevels in this system. Figure 2.9 illustrates the relative 
error of a four grey level system to a sixteen grey level 
system when the sensitivity is set to one. 

This assumption of the required sensitivity was tested 
Anne ጋንን. From testing, if was found that using 
zero resulted in a fairly constant mismatch of approximately 
two percent otf the pixels in any two images: 1) grabbed 
those toge cher Th ERE laboratory; 2) under artificial 
Is and. 3) directivitromtthe camera. This error is 


۸٨ ald consistent, and therefore of little concern. Then 
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the step of recording these same images onto videotape was 
introduced. When the images were replayed to the system, 
the standard error increased to approximately 3.8 percent. 
Again, though the error nearly doubled, it is relatively 
minor and — وو‎ 

When images obtained in the field were subjected to the . 
same test, the errors were in the fifteen percent range. 
This error is no longer acceptable; it is evident that the 
difference is the result of the uncontrolled characteristics 
o£ the natural lighting and is therefore not easily predict- 
able. The change in lighting appears to vary beyond the 
bounds of one PCEYE grey level. A solution might be to use 
one instead of zero for the sensitivity. As discussed 
above, such a decrease in the sensitivity yields results 
which can be too gross for realistic analysis. This problem 
will be resolvable upon introductions of more grey levels, 
where sensitivity can be more finely tuned. 

This may be a moot point, as the pixel for pixel 
matching is effective only where the two images can be 
absolutely overlaid. The significance of this constraint 
will be discussed in Chapters 3 and 4. 

SHOWPIC.EXE and PRNPICS.EXE exist to display previously 
saved PCEYE images on the screen, and to print such images, 
respectively. They serve no direct analytical purpose; 
however, review of hard-copy images can bring insight to the 


patterns assigned by the system. This is demonstrated in 
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Figure 2.4, where portions of the dark brown aluminum panels 
present a pattern similar to the buff brick. 
HISTO takes the complete screen image and produces a 

TE A of the number of pixels in each grey level. When 
counting the overall number of pixels per grey level in an 
image, the need for exact alignment is relaxed, but camera 
coverage still must be approximately the same. This is in 
contrast to the exact overlay required when doing the point- 
by-point comparison የ ሰዋ above. In this case, indivi- 
dual pixels may still have the fluctuation noted while using 
DIFFPGM, but by comparing the whole field of pixels, the 
variations of individual pixels should tend to balance. 

MEDAOI, developed at the Pennsylvania State University, 
lists the number of pixels in each grey level of an 
image, or any rectangular portion thereof; the user defines 
the upper left and lower right corners. The capability of 
this program to take any portion of the image is a distinct 
advantage, as areas external to the object can be reduced, 


but it is extraordinarily slow. 
V Lome & Rec l Media 


There is a great range of equipment available on the 
market today for recording video images. The quality of 
images produced can usually be correlated to the expense of 
the system; therefore, it will be important to determine the 


minimum image quality necessary for processing. 
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Camera Features: Advantages and Disadvantages 

Tube Type (Vidicon) versus Solid State (CCD). There 
are two general video camera types that are available to the 
retail purchaser. These are illustrated in Figure 2.5. 

The first is the older, tube type camera, also known as 
"vidicon." iSe COn entona l device Captures the visual 
image by passing an electron beam across a reactive screen. 
This produces an analog signal which is combined with a 
synchronizing signal within the camera, and is output as the 
Be 070 (or NISC) signal. This type of camera is currently 
more prevalent due to its longer time on the market. 

Although this is the type of camera used in this re- 
search (because it was available), it is not preferable for 
use on the construction site; the tube used in these cameras 
is fragile, susceptible to rough handling and environmental 
extremes, and is subject to "burn-in" (physical damage to 
the reactive screen) if directed at a light source, such as 
a window reflecting the sun or a welder at work. 

The second, newer type, the solid state camera, is also 
Known as a charge-coupled device or "CCD". This type of 
camera also has a screen onto which the image is cast, 
although the screen here is an array of "wells" on the sur- 
Pee Of a Semiconductor chip. The array is electronically 
2۳ 11:16 9 and produces a digital signal. Unfortunately, this 
digital signal is then converted to analog within the camera 


in order to match the current standard (RS-170) For record- 
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ing on standard videotape and for viewing on a standard 
television or monitor. Also, as discussed above, this 
convention has necessitated the use of an analog-to-digital 
converter in any computer vision system. 

The advantages of the solid state camera are its 
overall.hardiness over the tube type, and its increasing 
availability. The solid state components are not as suscep- 
tible as the tube to environmental conditions or burn-in. 
Manufacturers have almost phased out the tube type camera in 
favor of the solid state cameras for retail sales. 

aoom lens capabilities. Use of a fixed focal length 
lens greatly limits the flexibility of any data collection 
system in that it creates an extremely narrow range envelope 
from which the required data can be collected. A zoom lens 
can increase that envelope practically indefinitely. 

As an example of the flexibility that a zoom lens pro- 
vides, the camera used in this research has an eight-to-one 
zoom (10.5-84 mm), giving the user a large "envelope" within 
which to set up. In the case of a building 100 feet tall, 
for example, the usable distance for total coverage of the 
building ranges from about 150 feet to 1200 feet. 

This topic will be discussed further in the treatment 
of camera placement. 

Automatic and manual light level adjustments. Most 
current video cameras have the capability for automatic 


light level control. Most of these allow the user to switch 
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to manual control. There are distinct advantages and disad- 
vantages to each setting. 

The automatic light level adjustment feature allows the 
camera to adjust the iris to meet existing light levels in 
Mhomenronmentz This allows the overall lighting levels to 
change and produce no significant difference in the 
perceived image as captured by the camera. The major 
problem with the automatic adjustment occurs when there is a 
"hot spot" within the field of view, or even a bright object 
near the camera and not in the picture. The former could be 
a highly reflective surface, such as a window, or a welder 
at work; the latter a car in the foreground (as happened 
during the study). Either causes the camera to close down 
its aperture because of the perceived light increase, and 
causes the recorded image to be dark. 

The manual adjustment of the light level allows the 
user "NES that the light reflected from the surface of 
interest remains fairly constant. This requires the overall 
lighting remain constant. If the period of observation is 
short, ana assuming natural light and a relatively clear 
٩ INNMIe change in overall lightimg is likely. In this 
ideal situation it is a simple matter to use the fixed, 
manually set iris found on any standard camera. In the more 
common situation, it will be necessary to determine 
independently from the camera what the actual ambient 


lighting level is, through the use of a properly shielded 





23 


and directed light meter, and thereby set the proper iris 
manually or electronically. The latter system would simply 
be a customized automatic light level adjustment, but would 
overcome the problems inherent in the standard through-the- 
lens system. Again, as discussed previously, minor changes 
in lighting can be compensated by reducing the sensitivity 


on a better grey level system. 


Videotape Characteristic 

Video cassette recorders (VCRs) are designed to produce 
a color video signal which, while carrying the same informa- 
tion as the standard RS-170 signal, also carries the extra 
information that is used to produce a color image. As noted 
above, most frame grabbers are equipped to filter out the 
extra information. 

Resolution. The resolution of a videotaped image 
appears to be comparable to the direct input of the camera. 
The experimental results described above, using the compari- 
son program, would seem to support this assumption. The 
variations in the videotaped image that were imperceptible 
to the human eye were picked up by the PCEYE system, but 
were acceptable. 

Distortion. Another inherent problem with using the 
Videotape medium is the susceptibility of the tape to wear, 
Gausing degradation of the signal output, and the mechanical 


wear of the VCR mechanism, causing an increase of line 
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noise. Such degradation and interference can be controlled 
by limited reuse of videotape, and proper maintenance 


(especially head cleaning) of equipment. 
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CHAPTER. 3 


POSTION OT CAMERAS SITE TESTING 


Nature of Observed Object 


The nature Of the observed object is critical. Its na- 
ture is its general shape, dimensions, finishes, edges, 
depth, and visual accessibility (unobscured). 

Humans can observe an object, and subconsciously "fill 
meee gaps." The computer cannot do this, so it is 
necessary for the entire image to be captured by the camera 
or other equipment in use. The ability to capture that 
entire image is directly related to and restricted by the 
capabilities of the equipment in use. In this specific 
case, the equipment is a standard video camera with a zoom 
lens. The zoom lens allows great flexibility in locating 
the camera so the entire image can be captured and the 
effects of distortion caused by being too close to a large 
Object can be minimized. It can also introduce other 
errors, as will be discussed later in this chapter. 

Another factor which will come into play is the nature 
and orientation of lighting. People have a much more con- 
tinuous discernment of light levels than the computer, and 
are therefore more tolerant of poor lighting conditions. 
Where these poor lighting conditions exist, for instance 


Cao nee dare shadows, proper analysis will be difficult. 
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Volume, Area, or Lengtn 

A vertical area, such as a wall, provides the greatest 
ease of observation. The reason for this is the great flex- 
ibility available in camera setup positions. As will be 
demonstrated, the error incurred by varying the position of 
the camera, within the operational envelope, is minimal and 
easy to calculate (see Chapter 4 for errors introduced by 
emera position). This is also true for the error intro- 
duced by the distance differences of the extremes of the 
object. Furthermore, as will be discussed in Chapter 4, it 
isa simple matter to make a direct correlation between the 
number of pixels in the digitized image and the actual area 
observed. 

These statements concerning vertical areas can be 
۰ EO norızontal areas, provided That the camera can be 
positioned within the corresponding operational envelope. 

The system may be applied very simply to calculate 
Tengtns by establishing the endpoints of the object. Or, iE 
the width of the object is known, the area calculations can 
be manipulated to yield the width. 

The most complex feature to observe is a volume. It is 
probable that proper analysis of a volume is beyond the 
capability of a pattern recognition system of the nature now 
under consideration. Volume calculations are most often 


associated with earthmoving operations, and are therefore 
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mrobabivijin the realm of photogrammetry, at least where the 


limitation is to visual clues. 


Planar or Curved or Noncoplanar 

While actual observation of a complex geometry is no 
more difficult than that of a Simple plane, translation of 
such an image into useful data for measurement of area or 
آنا‎ 2 102171 ሆህ ርነ! the Simplest case of a planar object 
allows direct translation of the digitized image to actual 
area. When the object is not planar, the user must break 
the image into segments that the computer will handle 
individually. 
Identification/Description of 

selected Structures for Study 

initial observations were collected at construction 
sites in State College, Pennsylvania. These included a 
small office building and a hotel. It was determined that 
these construction Sites introduced too many variables at 


this stage of study. However, the images obtained clearly 


مم 


illustrated the potential and the problems of using a vision 
system. Figure 3.1 is a PCEYE image showing a portion of a 
small office building under construction. It was possible 
to see through to the other side of the building; this is 
.ج5‎ ት ን IM لان ل‎ 2 30 1 where the scaffolding on the back of 


the building appears in the image. 
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BUILDING UNDER CONSTRUCTION SHOWING 
EFFECT OF BACKLIGHTING AND SCAFFOLDING 
FIGURE 1 
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Figure 3.2 shows images developed from the Atherton 
Hotel, State College, at different stages of its construc- 
tion. It can be plainly seen that a simple vision system 
here has the capability to detect differences in the areas 
of work in place. In Figure 3.2(a), there are large areas 
of wall that show bare studs, with only the windows 
installed. The jagged appearance of the roofline in this 
figure is the result of plastic sheeting, draped over the 
edge of the roof to keep out the rain, blowing in the wind. 
Figure 3.2(b) clearly shows that the sheathing has now been 
installed, and the plastic sheeting has been removed. 

The southwest facade of Centre Community Hospital, 
State College, Pennsylvania, was finally chosen for this 
study. This subject is o£ the simplest nature, as described 
above, in that it is planar, for the most part, and is 
relatively unobscured by extraneous materials, such as 
foliage, plastic sheeting, scaffolding, etc. In addition, 
it provides a limited variety of construction materials for 


observation. 


Nat & di tekos..o luminatio 


Natural j k(.Sunlisah 


Needless to say, the position, intensity, color, and, 
in fact, existence of natural light are completely uncon- 
trollable. These characteristics can vary greatly from day 


to day, and will over periods of months. 
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ISO 15 project were taken under con- 
ditions of natural lighting. As stated, the facade under 
consideration has a southwest exposure. For reference, the 
observations were taken in mid-January. 

Time of Day. Observations were taken at three times on 
IESO am, 12:15 p.m., and 3:30 p.m. These 
times were chosen to center the observations between local 
sunrise and sunset. The 9 a.m. footage shows a distinct 
shadow line.across the facade not evident in either of the 
other observations. This shadow line moves noticeably in 
the short (approximately fifteen minute) period of filming. 
As foreseen above, the computer cannot discern the continu- 
ation of the wall into the shadow line; this could introduce 
a large error over the time span when the shadows move 
rapidly, we early morning and late afternoon. 

Weather. On the day of observation, the weather was 
۳۶ ۱ ۱: described as partly cloudy, but was subjectively 
7348960 ٤ت‎ De a very clear day. Temperatures ranged from the 
fow teens to the mid=thirties (degrees Fahrenheit) through 
the day. These temperatures were below the recommended 
operational ranges of the videotape equipment, and although 
performance was satisfactory, the rechargeable batteries for 
che system required frequent changing. 

Orientation. The orientation of the sun varies greatly 


Detween the three observation periods. The three times of 
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observation were chosen intentionally to demonstrate the 
differences introduced by the changed angle of incidence of 
the sunlight upon the building face. The specifics of how 
this orientation will change is also dependent upon the time 
of year. 

As can be seen in the morning image (Figure 3.3), the 
sun was low and to the southeast of the building. The adja- 
cent hospital wing, on the right, is casting the large 
shadow across the observed face. The noon observation 
(Figure 3.4) placed the sun almost directly behind the 
observation camera. This caused the reflection from the 
buff brick surfaces to saturate the image, reducing the 
contrast. The late afternoon image (Figure 3.5) shows more 
contrast than the noon image, but the nature of the 
illumination is also different from that observed in the 
morning. The overall light level is higher than in the 
morning image. Additional images from the three observation 


periods can be found in Appendix A. 


aw لو‎ Light. 

The use of artificial light has both advantages and 
disadvantages. Its primary advantages are: the light level 
is constant and controllable; the orientation of the light 
source is fixed and controllable; and additional lighting 
can be added to minimize shadowed areas. The primary 


disadvantages are: artificial lighting is expensive; and it 
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can cause harsh shadow lines which the computer cannot "see" 
purough. 

Qrientation with respect to object. Control of the 
orientation of artificial lighting was cited as a primary 
advantage in the previous section. This, of course, is in 
contrast to natural light variations and the weather. 

There are three overall orientations which can be 
addressed for the illumination of a building exterior. The 
First two involve lighting attached to the building itself 
and directed either up from the bottom, or down from the 
top. This scheme of lighting is acceptable in a constructed 
Facility where the sole purpose is to show the building. 
However, in a building under construction, the purpose of 
illumination is to enable the craftsman to carry out a task. 
Lighting in the first configuration will likely restrict 
Hess ro the Structure. In the second configuration, the 
light will likely be in the workers' eyes. In either case, 
the lighting is not an aide but a potential safety hazard. 

The third, and preferable, configuration is for the 
lighting equipment to be located some safe distance away, 
and the light directed back onto the structure. The 
distances used are obviously dependent upon the size of the 
building, the site conditions, and the capabilities of the 
equipment. Also, the lighting should be placed at whatever 
height best illuminates the areas of interest, as is 


physically possible. 
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The artificial illumination used to see the exterior of 
the building is not the only case of concern in capturing 
images. As can be seen in the prints of the images (Figures 
3.3, 3.4, 3.5, and Appendix A), lighting inside the building 
drastically changed through the course of the day. In the 
observed case, the changing interior lighting was the result 
of light-colored curtains in the windows being opened and 
closed, and reflecting the sunlight. This is an excellent 
illustration of the corresponding effect that interior 
lights on a construction site would have on the image. In 
the worst case, it would adversely effect the overall 
lighting level, and degrade the usefulness of the image for 
comparison. Much the same effect can be observed in the 
situation where unshielded welding is ongoing. 

Another form of "artificial" light which should be con- 
sidered is backlighting. If the building is at a stage 
where you can see all the way through it, the light coming 
through is indistinguishable from the light reflected from 
the structure. This was clearly illustrated in Figure 3.1. 

nterference and/or Combination with Available Natural 
Light. The relationship between natural and artificial 
light could be most beneficial when the artificial is used 
to fill in the gaps, or shadows, left by the natural. In 
bright daylight this is unrealistic, but in the early 
morning or evening, or in heavily overcast conditions, this 


could be of great benefit. The gap in this hypothesis is 
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that these periods already require artificial lighting due 


to the low natural lighting level. 


As described earlier, the differentiation program 
showed a regular error, even when observations were taken 
very close together, the camera was not moved, and even when 
the observations were taken under artificial lights in the 
controlled setting of the laboratory. The error was larger, 
but still fairly consistent, when the additional variable of 
natural light in the field was introduced. The basic error 
level may be the result of inconsistency by PCEYE when 
assigning the grey scale values to an image. We cannot pre- 
dict this error ahead of time, but we can determine it under 
each new set of field conditions, and account for it in our 
calculations. 

The problem in using a differentiation type program is 
that the compared images must be matched precisely. This 
known as image registration. Using standard video equip- 
ment, daily camera setups are impossible to duplicate. The 
actual precise placing of a standard video camera is as 
simple as using a plumb bob; the key for this application is 
directing the camera so as to precisely duplicate the 
captured image. For this exact duplication to be achieved, 
it will be necessary to use methods of precision optical 


alignment, common in surveying; this must be done through 
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the lens or through other optics attached to the camera, and 
referenced to a fixed object. From that reference, it will 
be necessary to have the capability of rotating the camera 
in the vertical and horizontal planes to the desired 
position (as with a transit or theodolite). This will 
likely entail permanent modification and customization of 
equipment, as the standard through-the-lens image lacks the 


precision necessary. 


Range Targetable at One Time (Depth © EME 

The primary limitation evident in the depth of field 
variable of the video camera is the randomness with which 
the camera focused on the object under wide angle view. For 
example, at distances up to 300 feet, the automatic focusing 
mechanism systematically focused at three to six feet, 
instead of the "infinity" range expected. The visual image 
does not suffer appreciably, but the actual result is much 
more homogeneous to the computer. This is acceptable for 
gross estimates of area, or for simple counting, using the 
current system. This would not be acceptable for work 
requiring higher precision, such as applications in quality 


Control. 


oximity to Object Relative to Object Size 
The immediate result of collecting data at close range 
posae cdrallocor long) object is the familiar distortion 


of a tall building narrowing at the top, or the roadway 
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narrowing in the distance. This distortion decreases as the 
relative distance from the object increases. 

The origin of this distortion is the nature of the 
"field of view" of the viewing device, this being the human 
eye or the camera lens. Field of view is expressed in terms 
of an angle. When this angle is a constant, as it is for 
any one focal length of a camera, or our eyes, the actual 
area covered by the field of view is dependent on the 
distance of the object from the observer. As the distance 
increases, the actual width or height of the field of view 
increases proportionally. 

In the present case, for illustration, we can assume 
that the object has constant width. As the distance to the 
object increases, the object fills less of the field of 
view. The observer perceives this as the object becoming 
narrower at the top or in the distance. The greater the 
difference between the near and far points on the object, 
the greater the distortion. The geometry of this situation 


is illustrated and analyzed in Chapter 4. 


Role of Zoom Lens at Different Ranges 


As briefly stated above, the use of a zoom lens can 
greatly increase the flexibility with which we can choose a 
site for the camera setup. It can give us a virtually 
unlimited number of locations from which the image is 


obtainable. In addition, it can be a primary tool for 
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fighting the visual distortion described in the previous 
section. With a zoom or telephoto lens, the observation can 
be made from a greater distance, thus removing the 
distortion, and will still capture the entire image desired. 

These advantage of the zoom lens are not without their 
complications, however. While the flexibility of the zoom 
lens is ideal for the construction site, where the observer 
may be required to change position regularly, it also 
introduces another, imprecisely controllable variable. In 
the common commercially available video camera, the focal 
length markings on the lens are vague at best. There are no 
"click stops" which would indicate precise settings for 
focal length. The best the user could hope for is that the 
highest and lowest settings are reasonably reliable; judging 
by the performance of the model used for data gathering, 
this is a poor assumption. The zoom mechanism gives a very 
poor response to manual adjustment. 

For these reasons, it is recommended that great effort 
be made to establish fixed locations from which to gather 
data, and hence used one of several fixed focal length 
lenses. If the advantages of the zoom lens are of great 
need to the user, the zoom must have precise metering for 


rapid and correct determination of focal length. 





CHAPTER 4 


PROCESSINS OF DATA 


Relationship of Pixels to Object Size 
The computer does not report the number of square feet. 
It reports the number of pixels in the digitized image. 
Users must understand that due to the conditions on the 
site, it may be necessary to obtain data from different 
positions. Each camera position will have its own unique 
relationship to the building, which will determine what the 


number of pixels equates to relative to the structure being 


measured. Each unique pixel ratio must be provided to the 
computer for analysis. There are several alternatives to 
accomplish this. These alternatives will require the 


creation or acquisition of software with different 


capabilities than the system employed in this study. 


۳۱۹۱۳۱۱۱۵ Input 

Manual input is probably the simplest method for 
Brrrot minor the Computer of the variables applicable to indi- 
vidual sets of acquired images. This requires that speci- 
mes relative to the controlling variables be provided at 
the prompting of the system. 

First, some object of fixed dimension within the frame 
must be identified, from which the computer can derive the 
ratio o£ pixels to feet at a given point in the image. This 


identification can be accomplished by using the cursor keys 
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or a "mouse" to establish the endpoints of the object, which 
could be as simple as a surveyor's range pole. This stan- 
dard must be at the same distance as the object of the 
study, or both distances must be known and provided. 

In addition, it will be necessary to provide such 
variables as distance from the object, focal length of the 
lens, the vertical (elevation) angle of the lens, and the 
camera relationship to the centerline of object. 

The necessity of the first variable (reference object) 
Straightforward, but qualified in that the ratio deter- 
mined is in fact accurate only at that point on the screen. 
Deviation from the determined ratio increases with the 
distance from that point. 

As discussed in Chapter 3, the angular field of view of 
a Camera is fixed, but the actual dimensions covered depend 
directly upon the distance of the object from the lens. As 
the distance increases, the field of view grows, so an 
object of fixed dimension appears to shrink. This explains 
why a tall building gets narrower at the top. 

It is necessary to quantify this effect if a true 
output is to be obtained from the system. The ratio of 
pixels to square feet is variable, and dependent upon the 
location of any particular point on the screen. This is 
controlled by the six-dimensional relationship of the camera 
A eS tE xXx; VV, 2, meu maetron, horizontal angle, and 


tilt). This relationship is illustrated in Figure 4.1. 
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The first step in defining the equations which will 


determine the ratio is to define the variables to be dis- 


cussed: 

Di = horizontal distance from camera to object 

he = height of reference object 

h = height of object with respect to camera 

Do - distance to reference object 

D = distance to any point in the image 

Ro = screen row of the fixed dimension given 

a, = the vertical angle of the camera 

Ov, = the vertical angular field of view 

Co = screen column of the centerline of fixed dimension 

“un - horizontal angle of the camera with respect to 
perpendicular from building centerline 

On = the horizontal angular field of view 

6 = angular separation from reference point 


These are illustrated in Figures 4.2, 4.3, and 4.4. 

of the sizes of the fields of view at any two‏ ہی تا 
points moving up or down a structure will determine the‏ 
ratio of the pixel "sizes" at those levels.‏ 

In Figure 4.2, the dimensions designated as Ho and H 
are physically the same. In this example, however, the 
distances to those two points are different, designated by 


Do and D respectively. These values, shown in Figure 4.3, 


are. 


Do = ነ DAS cns 


(4-1) 
D = ٥۶ + (h+dh)= 
But h = D,tanevs and dh 2 Dnhftan(acotċ)-tanauo); 


miehetone, bY substitution, 
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Deo = DR \ 1 T tan aco = DA Secas, 


(4-2) 
and DADA رک تک کت ها شود‎ - D.sec(«.o + ó). 
Since the field of view is represented by 
FOV = 20D tan(o./2) (4-3) 
the ratio of the two fields of view shown above is 
FOV 2 (D..seclav>s ተ ዕ)) tan(on/2Z) 
FOV ratio * ——— * —————————————————————— (4-4) 
۳۷ ت‎ 2 (Dnsecave) tanlon/2) 
which simplifies to: 
sec(&vo + 6) cos(aco) 
FOV ratio 2» —————————— 0 ااال سلسم‎ (4-5) 
sec(aco) cos(«.co + 6) 


This field of view ratio, which will also be termed the 
Pixel Ratio, will act as a multiplier. By equating the 
vertical screen dimension to a visual angle, the angular 
difference between the various points of interest, ó, can be 
expressed in terms of the screen row of the latter (D). 

The system specifications will determine whether the 
camera image will fill or fit on the screen. Vertical and 
horizontal directions must be addressed separately. PCEYE 
clips the image in both directions. For the purpose of this 
derivation, co. has been chosen as the vertical angle of equiva- 
lence, as if the vertical dimension of the image has been 


made to fill the screen vertically. 
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$ 0— ov (4-6) 


The field of view ratio now becomes the Pixel Ratio bv 

substitution for 6 in the FOV ratio equation (4-5). 
COS (vo) 
PR, = PIXEL RATIO = — سس‎ (4-7) 
Ro IR 
COS (aA نبي‎ T ee o) 
200 
me v Subscript indicates that this pixel ratio adjusts for 
the distortion due to the differing distance in the vertical 
direction. A similar derivation, for the horizontal direc- 
tion, is contained in Appendix B. The variables involved 
are illustrated in Figure 4.4. CC. represents the centerline 
of the reference known dimension. The resultant horizontal 
Edel ratio is: 
COS (Eno) 
PR = መመመ ( 4-8) 
E - Co 
COS(&@no + | ———— O4) 
SZ 

T"aetotal pixel ratio, “the product of PR. and PR,, then 
gives the comprehensive multiplier that can be applied to 
any point on the screen and determine the physical area to 
which it equates, based on its (Row,Column) coordinates. In 


order to put PR- in terms of the given parameters, it must 


De recognized that: 





ol 














Ov Ov 100 - Re 
Avo = AY + EST zags + o O (479) 
2 200 200 
On ፓኑ Co በዜ 160 
and Uno = An ~ + Col ) = Apn 3 — In (4-10) 
2 920 د‎ 0 


Substituting into the PR. and PR. equations, PR. becomes a 


function of Row and Column: 


100 - Ro Co ~ 160 
coslas + — oO.) coslan + —— In) 
200 320 
mm nn nn nn (4-11) 
100 - R C - 160 
coslas * ———— OL) cosm * ————— On) 
200 د‎ 20 


In this section, it has been demonstrated that the com- 
puter may calculate areas by counting the necessary pixels 
and applying their unique multipliers to obtain a measure of 
the actual area. That multiplier can vary widely with the 
conditions and the values of the multiple variables. 
Appendix C contains a short computer program and several 
runs showing the range of multipliers for different 
scenarios using the equipment available. Searching a range 
of zero to forty-five degrees for a, and «.,, the maximum PR- 
was 2.6818173 and the minimum 0.23047097. It should be 
noted that this is the worst case scenario, and resulted at 
both values equal to forty-five degrees, and the lens at its 


minimum focal length (maximum on, Ov). 





aAmbomatic" Input 

The previous section discussed the steps to be taken by 
the computer, given a required input by the user. It would 
be preferable if the user did not have to repeatedly take 
manual measurements, and then report them to the computer. 
There are several methods that may preclude this necessity 
by automating part or all of this particular step in the 
data collection process. These include incorporation of: 
radio frequency (RF) triangulation for Camera position; a 
fixed three dimensional target; and utilization of "total 
station” surveying technology. 

RF Triangulation. Simple methods of triangulation can 
56 used to pinpoint the location (x, y. z) of the camera. 
This data could be stored directly on the videotape and read 
Bye the computer at the beginning of any session. 

Three Dimensional Target. A three dimensional target 
of known size, location, and features would serve the 
purpose of providing a sufficiently intelligent and discri- 
minating system with all the necessary variables described 
above except rone. It would still be necessary for the 
range or the focal length of the camera to be provided 
separately. This separate data input may also be automated 
Benougmeelectronic reporting of focal length or electronic 
range finding. 

"Total Station" Technology. A total station is an 


electronic surveying instrument that has the capability to 
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electronically record all measurements taken. The (x,v,z) 
coordinates of the station can be determined BV 1 ۹ 
any equipment setup with readings taken of points of known 
coordinates and elevation. The horizontal and vertical 
angles used to sight these points act as a base of reference 
ms crtuture observations. The distance to the object is 
determined by the total station using infrared rangefinding. 

It is feasible for our camera equipment to be mounted in 
concert with the total station so that the two move as one, 
the camera being a known distance above the total station 


reference. 
How Does Productivity Come From This? 


At this point it is necessary to review and address the 
Origin of this specific research project. The purpose in- 
bomdedmtor the introduction of computer vision to construc- 
tion was for automated collection and processing of 
productivity data. 

We now know that the information necessary can be 
collected on videotape and fed to a computer that should be 
able to determine the actual dimensions of the object being 
observed. This necessarily includes the capability to have 
the video input directly to a computer on sight. This has 
the advantage of providing more immediate data for use. 

Determination of straight units-per-hour productivity 


is a simple matter of the computer making observations at 


۲ 
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determinable intervals and comparing, or subtracting, the 


characteristics of subsequent images. Again, this can be 
done real-time on the job site, or back in the office at 
normal speed or fast-forward. 

It is possible that with the proper feedback and soft- 
ware, the computer will be able to carry out the operation 
with little or no supervision. The overriding constraints 
to this possibility at this time is the limitations on the 
computer vision system to properly discern between different 
areas that may provide similar digital patterns. This was 
observed in several instances among the images obtained for 
this project. 

For the foreseeable future, until the systems available 
for use are more refined, and the software is developed, 
these systems will require constant involvement and input by 


human operators. 
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ie Wisin |S 


SUMMARY AND CONCLUSIONS 


Summary 


Mmitiative 

The driving motivation behind the entrance into this 
field was the extremely labor intensive methods required for 
productivity data collection. The current methods involve 
manual collection of data which require extensive review and 
correlation before meaningful results can be determined. 

A system is desired with the potential to provide 
real-time productivity analysis, whereby the supervision on 
the job site can more rapidly respond to productivity 
Problems, and eliminate those periods of low productivity 
which cannot be addressed due to the gap between performance 
and analysis of data. 

innempotential for automation of this process can be 
seen in light of the advances in the field of computer 
Sence and, more specitically»wepattern recognition. This 
potential has accomplished much in the manufacturing field 
۱-٢ ال 2278 اا‎ د8٤١‎ speeatgudiitv controlWtinspection of products 


on the assembly line. 
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Approach 

ne Daslo approach taken in this research project was 
۳ ۲۱۲1۱ (011 8۳۵ 56 features unique to the construction site, 
wto explore the potential barriers in physical character- 
istics and technology. The predominant factors under 
consideration are the matters of lighting, distance, and 
۱0 ۱۱11۳27 of object. 

The initial step was to become familiar with the con- 
cepts of computer vision and the basic equipment available 
for use. 

Observations were then taken at several construction 
Sites. However, it was found that the dynamic nature of the 
facilities under construction introduced too many variables 
for proper analysis at this time. 

After these initial construction site attempts, an 
existing structure was settled upon for the remainder of 
tne study. This existing facility provided a limited number 
of materials and a fairly unobscured view, thus reducing the 
variables in the equation. Observations were then made from 
three separate distances, at three times of day, at two 
focal lengths, to determine and illustrate the effects of 
the differences in lighting (intensity and orientation) and 


distance (under varied magnification). 
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Malor Problems Encountered 

There were many problems encountered during the process 
of applying the system to the building environment. Tables 
5.1 and 5.2 contain a summary of the physical and techno- 
logical barriers encountered, respectively. The information 
was put in this format in order to provide the reader a 
succinct and complete overview of the perceived problems. 


These Tables should be able to stand alone. 
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TABLE 5.1 
Physical Barriers Encountered 


The existence of physical obstructions 
throughout the printed images is readily 
apparent. These include trees and shrubs, 
signs, and lampposts. 


The least disruptive method for removing 
these obstacles from the image is to move 
inside the obstacles; this is not always 
possible. 


Physically remove the obstacle. In most 
cases this may be infeasible, in the case of 
large equipment, or unpopular, as for trees. 


Develop image "Subtraction" routine which 
can recognize an obstacle as part of the 
"landscape" and can filter out the obstacle 
and fill in the remaining gaps by 
"Dainting" in the missing object. 


Shadows drastically change the character- 
istics of the object being analyzed. Deep 
shadows caused by bright lighting conditions 
are opaque/black to the digitizing system. 


Utilize vision system with greater number of 
grey levels. These systems more discri- 
minately assign grey levels and may have 

the depth to assign non-black values to the 
shadows. 


Fill in the shadows by supplementing natural 
ከ 111115 1۱٢ 1ک د‎ ስመ. ወ የ የሠ 112. This 
will alleviate the severe underlighting that 
causes the sharp shadow and makes the image 
black in that region. 


The angle of the incident light, directly 
dependent upon the time of day and year, or 
bhemlocatronvor tħetartificial lighting, mav 
cause highly reflective surfaces to appear 
much brighter than they actually are, or to 
appear to be other materials altogether. 


(continued on next page) 
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Changing 
Features 


Suggestions 


وہ 


( ۴1626 ٦ت‏ ت۰ )۲:۱۰۶۰ ط۵۸٣‏ 


Incorporate color capabilities into the 
system to better distinguish individual 
fields of interest. 


Introduce polarization to reduce the 
intensity of light reflecting from polished 
surfaces. This will have lesser effect on 
rough surfaces where the light is reflected 
more randomly. 


Study surface treatment characteristics of 
construction materials. 


Items such as welding shields, scaffolding, 
or other temporary facilities which are 
moved during construction will randomly 
introduce error into the image. 


Manually compensate for the error. 


Develop algorithms which can recognize such 
obstructions and essentially subtract them 
from the image, filling in the gaps left 
behind with the building. 
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TABLE 5-2 
echnoloaical Problems 


This distortion can be of two types: the 
narrowing caused by the expansion of the 
field of view with distance (e.g. on a tall 
ط‎ ۰٣۱٦٣۸9 ت "ء7‎ 1090۲۴516 Curving found at the 
edges of an image caused by imprecision on 
the part of the camera lens. 


DuemedgemdistomBtion ista direct function of 
the construction and quality of the camera 
lens; therefore this error can be eliminated 
by use of higher quality equipment. 


The narrowing effect of distance can be 
corrected by use of special lens additions 
that straighten the inclined lines. 


The narrowing effect can be compensated 
through the use of algorithms which define 
the relative values of pixels at different 
locations on the screen. 


The grossness of a four grey level system 
results in images which can be ill-defined 
and can exhibit some randomness in assign- 
ment of grey levels to regions with 
comparable light intensities. 


Move up to a system with more grey levels. 
The more grey levels the system has, the 
narrower will be the actual range of light 
intensities that will be assigned to any one 
level. This will allow flexibility in 
sensitivity definition on a case by case 
basis. Sixteen. or sixty-four grey level 
systems should be a sufficient next step. 


When the camera position and alignment 
cannot be permanent or preserved from day to 
day, the existing equipment does not have 
the capability for precision in spatial 

۹ 51 1717 Ny mane 2) Or in alignment 
(elevation, direction, and tilt). 6 
enrough the- -lens optics are inadequate for 
these purposes. 


(continued on next page) 
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TABLE 5.2(continued) 


Customize camera equipment by mounting on a 
calibrated base on a fully adjustable 
tripod, and attaching additional precision 
sights for referencing established points. 


Mount camera in conjunction with existing 
"Total Station" technology to determine 
actual position, rather than attempt to 
establish a desired position, and translate 
the new images to the old position 
algorithmically. 
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Conclusions 


PCEYE as a Learning Tool 

The PCEYE system used in the research served as an 
excellent tool to learn the basics of how a computer vision 
system works. It also helped to spark an interest in what 
the potential for such a system might be through showing 
where its shortcomings were. These limitations are, 
however, what keeps it from being a feasible system for end 
pn this effort. 

The primary limitations of the system are its low reso- 
lution, small number of grey levels, and incredibly slow 
processing speeds. The first two deficiencies are simple to 
overcome, as systems exist with higher resolution and more 
grey levels; they are also quite a bit more expensive. The 
latter is as much a function of the hardware (a standard PC) 
as the PCEYE software. Again, this can be overcome by 
upgrading to currently available equipment, at much greater 


cost. 


Adequacy o£ Video Resolution 


The resolution of the video images obtained were 
adequate for the purpose of this research, but for actual 
practice it may be desirable to use more precise equipment, 
particularly in the areas of focusing and focal length 
determination in a zoom application. It must be kept in 


mind that the video camera and equipment used were marketed 
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for home movies for human observers, and are not expected to 
produce as precise an image as we desire for our automated 


processing methods. 


ሦ 1111 11 
The successful construction of a system that automates 
the collection and processing of visual data for the purpose 
of productivity analysis is the goal. This will require the 
extended attention and cooperation of experts in the fields 
of computer science (and expert systems), pattern recogni- 
۴ 1 Vision systems, and construction and productivity. 
The long range goals of this research are feasible. 
This should be qualified, however, by saying that it will 


not be done overnight. 


m d lions uture Dire 


There are several areas that warrant immediate consi- 
deration as the next steps into adapting vision systems to 
the construction industry are considered. The overriding 
and required preliminary step to any of these items, how- 
ever, is the specific requirement for a new "level" of 
equipment to be acquired. This equipment should have higher 
resolution, higher discrimination of light intensities, and 
higher speed. 

The first of these "next steps" or items of considera- 


۱3 ۵۲ 15 The ertect or contribution that the different colors 
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and finishes of construction materials make to the produc- 
tion of an image. As shown in Chapter 3 and Appendix A, 
similar patterns in the processed image can be produced by 
different materials. By studying the effects of colors and 
finishes, in conjunction with incident light characteris- 
tics,- the error involved in equating different materials can 
be recognized and controlled. 

The second item which bears investigation is the 
problem encountered in trying to see into the shadows. It 
is hoped that a more discerning grey scale breakdown will 
alleviate this problem. 

The third item that was not covered with any depth to 
this point is the effects of physical obstructions on the 
construction site, particularly those, such as scaffolding, 
plastic sheeting, and other temporary structures whose 
position is constantly or periodically changing. Immovable 
2۰۰۲-٣ 5و ط6ت‎ trees, and other buildings, may be compen- 
sated throughout the study, but when the obstruction ís 
mobile, the compensation can no longer be manual due to the 
time involved. The comprehensive effect on the image has 
not been determined at this time. 

The fourth, and probably biggest, step for consi- 
ادن‎ 2 ۰ 1 IS Che introduction o£ this system to interior 
areas of the construction site. To date this work has been 
directed at inspecting exterior walls. It is another matter 


altogether for a system to inspect an interior setting where 
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distances are much smaller in proportion to the items to be 
observed, requiring a wider angle lens (extremely short 
focal length). In addition, the camera cannot be stationary 
and still obtain the necessary images, but its positions 
must still be determinable for subsequent images to be 


compared. 
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This appendix contains the derivation of the horizontal 


pixel ratio used in Chapter 4. The variables used in the 


derivation are as follows: 


Dn = horizontal distance from camera to object 

b = offset distance to reference object (perp. to Dn) 

Dw = distance to reference object 

D = distance to any point in the image 

Co = screen column of the centerline of fixed dimension 

a. = horizontal angle of the camera with respect to 
perpendicular from building centerline 

On = the horizontal angular field of view 

6 = angular separation from reference point 


All variables used in this derivation are illustrated in 


Figure 4.4. 


CB =) 





(Dn + (b+db)*) 





But b = Ditanene and db = Du,(tanlano+ö)-tanann); therefore, 


bumsuobstitutien: 


Do = Dı j )1 + tan^?«uo) = Dı Se CA ت‎ 


(B-2) 
and D = NES! + tan“(ano + 6)) = Dnseclaune + 6). 
Since the vertical field of view is represented by 
FOV = 27D *tancoc, 2) (B-3) 
the ratio of the two fields of view shown above is 
FOV 2(Dusec(ano + 6Ġ))tan(o./2) 
FOV ratio = — 22a  _ mm مس‎ (B-4) 
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dcn Simplifies to: 


secí(auo + 6 ( COS (ano) 1 
FOV ratio = — = au (B-5) 
SEC (Ano) cosl(ane + 6) 


For the purpose of this derivation, on has been chosen 
as the horizontal angle of equivalence, so that the angular 
separation equates on the screen to: 

ہت جج ہت 
S On (B-6)‏ 6 
360 
The field of view ratio now becomes the Pixel Ratio by‏ 
substitution for 6 in the FOV ratio equation (B-5).‏ 
COS ( Ano)‏ 
EU‏ مم mm‏ جح PR. - PIXEL RATIO‏ 
C = Co‏ 
COS(G@ne + ———— On)‏ 
360 
The h subscript indicates that this pixel ratio adjusts‏ 


for the distortion due to the differing distance in the 


horizontal direction. 
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100 KKK KKK KE KKK EEK EK KE KKK ደጸ ዚደ زار‎ ከጽ ይከ ከከከ ከከከ ከ چو جو جو چو پیر چو‎ ጹጽጽክክጂከኪእንከከ ከከከ ከዚ ېږ ېږ پر از پیر جو پر جو چو‎ X X EX ېږ ې‎ 
110 ‘xx 

120 '** 

130 '** PROGRAM TO DETERMINE MAXIMUM AND MINIMUM PRV 2 
140 ‘xx 3 
150 ۶ 
160 m 2:2 2222 22 22 2 22 2 oo 2 2 2 222 222 2 2 2 2 2 2 2 2 2 2 22 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 C 2 2 2 2 2 2 2 2 2 2 2 2 2 


170 PI = 4*ATN(1) 

MSOMPRINI JALPHAVTU SIGMAV", Y"BRSUBL", "PRSUB2"Y, "PRSUB3", "PRSUB4" 
190 PRINT 

200 PRMAX 0 

210 PRMIN 100 

220 '45 DEGREES IS THE MAXIMUM VERTICAL ANGLE CONSIDERED 

230 FOR ALPHAV = 0 TO 45 STEP 5 

240 ALPHAVEE = ALPHAV * PI/180 

250 PRINT ALPHAV 

260 "CAMERA USED FOR RESEARCH HAS VERTICAL FOV OF 5-33 DEGREES 
270 FOR SIGMAV = 5 TO 33 STEP 4 


280 SIGMAVEE = SIGMAV * PI/180 

290 "CASE 1: REFERENCE AT TOP OF SCREEN, POINT OF INTEREST AT BOTTOM. 
300 PRSUB1 = COS(ALPHAVEE + SIGMAVEE*.5)/COS(ALPHAVEE + SIGMAVEE*(-.5) 
310 'CASE 2: REFERENCE AT TOP OF SCREEN, POINT OF INTEREST AT MIDDLE.. 
30 PRSUB2 = COS(ALPHAVEE + SIGMAVEE*.5)/COS(ALPHAVEE) 

330 CEE: REFERENCE ATZWEDDERZORF SCREEN, POINT OF INTEREST AT TOP. 
340 PRSUB3 - COS(ALPHAVEE)/COS(ALPHAVEE * SIGMAVEE * .5) 

350 "CASE 4: REFERENCE AT MIDDLE OF SCREEN, POINT OF INTEREST AT BOTT 
360 PRSUB4 = COS(ALPHAVEE)/COS(ALPHAVEE + SIGMAVEE *(-.5)) 


370 PRINT SPC(8);SIGMAV, PRSUB1, PRSUB2, PRSUB3, PRSUB4 

" PRMAX 4 PRSUB1 THEN PRMAX PRSUB1:MAX-SIGMAV:SUB-1 
390 IF PRMAX ረ PRSUB2 THEN PRMAX PRSUB2: MAX=SIGMAV : SUB=2 
400 IF PRMAX > PRSUB3 THEN PRMAX PRSUB3 : MAX=SIGMAV: SUB=3 
410 IF PRMAX < 85084 THEN PRMAX PRSUB4 : MAX=SIGMAV: SUB=4 
420 IF PRMIN > PRSUB1 THEN PRMIN PRSUB1:MIN=SIGMAV:SU=1 
430 IF PRMIN > PRSUB2 THEN PRMIN PRSUB2:MIN=SIGMAV: SU=2 
440 IF PRMIN > PRSUB3 THEN PRMIN PRSUB3: MIN-SIGMAV: SU-3 
450 IF PRMIN د‎ PRSUB4 THEN PRMIN PRSUB4: MIN-SIGMAV:SU-4 
460 NEXT SIGMAV ' 
470 NEXT ALPHAV 
480 END 


uU ww uy gy t 





93 


ALFHAV SIGMAV FRESUEAI ان ۲۰ الات ]ا‎ F'EBUBA 


ISS e e ۶۴ 
نا ل‎ LUN )له‎ 


ፍ 


f 


Q 
el 1 IIS 120003933 1, 8j 
3 1 . 39G31 73 1-005035 1.020938 
18 1 Ne us ከጠ. 1.00647 1.006E47 
17 1 . "2303201: ቁ ۱ ۱ ۱ 
። 1 1 «EISE EE 1. 017089 1 O, 
Za 1 ee 1 . )( +3 12072268 
DJ 1 . 2601-76 1.0923 1,0925 
c 1 َء‎ ۹ 1.042943 1.042949 

لے 


اٹ TE‏ 
260 نا93 
37:11:39" . 
ني 8 3 2 ۷ سره 
En‏ 
97231 
3434707 


ا 


: اا‎ 
SU 
f) 
CJ 
خم‎ 


3926678 
39760942 
2 2370ء 
35726 

E SE 
3333716 


1.003771 
1.010047 
11() ي1 
1206221502 
ZEN‏ او 
1.044227 
6 1 
0705636 .1 


1 
1 


I 
1% 


3371437 
3562326 


3396526 1 
09067 


¿€ QOOtSO.Z 
. O04731 
010037 
DIEGOS 


=) 3647204 S S 1.009710 ጋ). 
Ej -3726254 «6:300: ? 140172208 መከ... 
14 ۰. 11... ال ان رل‎ 1.527104 . 20866479 
17 . 3436497 : fe 5) eU 1.038472 . 285 1:15 
zt ٠00 0 21) 1) ۶ 0 . 284845 

5 0ے‎ 7 24900151 1.۱63 39971, 27 
155 «c 5 ee, lL UBS, 7 ۱ ۸۰ J 
55 "3007243 . 0837301 1.100425 23ک‎ AEE 
all . 3737-7 . 3973604 1.01 SOI 3 60 

“y . 35386343 . 37593344 1.024701 Jeu 7 
be . 3407503 ۰. 210: 1. 026164 . 7ت‎ 6 G 
17 ተቄ 1 . 3494104 Peso Lag UM 

ml . ات م1۳‎ «ጋማ ان‎ 1. 7 TCG O 
ed ግ ሙሚ ሪር AS Dogg: 7 . 266846 

2 2 (7۵ ت6 ()89 - 1 رد 1 ايند تا‎ mE SC 
5-5 ا نت نت‎ - b ۹ በበ ئگ نت‎ 71 
El 36971-1€ ;JOL L-2201 3 ۶۶۶ دي‎ ٦٣ 
a 0ا6ت 360 . ان با‎ IDOE . 3751 0 
16 «ጋ. ጋርተ 2030ء‎ ٦٤ TOUS DENS TEE INN 

7 1ل لا‎ NC ۴ 1.06927 3ت 3504ء‎ 
1 OZ UL ل‎ Eo eS 1.0906 3ء‎ ٦ 
E . 3506687 1د 7ه‎ 1 1 . 1 1 - 1 3 . ኃ4ዯ፡፡፡(፡)1.. 
zw لد اسا 1 0770167- ۷۶ 7ھ‎ . +002 
oe ume 17 . 055-4465 11 20 . "211١1-1-36 
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لب ہد 
AS . 38903804‏ ْ3" ۶ءء 5 
7 ناد ۴ ھ ھ7 | ll 0 Io‏ 5 
۴ 29ھ 1.062243 کا 7/9 3240 . () 321 نا . 1 
ea‏ لد ل ۵( 1 11 لاه lez NS) ACCES‏ 
EM POFO SUE PS E :‏ 
Ed 3120155 .37 553694 1.142376 27.1‏ 
٢١ 1.174545 ee 1747 7‏ دانره 7047654 e‏ 0 
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. 1 35 
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01 5ق‎ 
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"۰:06 


1.7 
1.090031 
120722330 
1. 1)E 38393 
12-1 تا‎ 
1.174627 
1221-1 35 
۳ 


IOS 
دا.1‎ 6143 
ENN IGG 
131 e 
ص9 ط0‎ 6» 
Ide. 


3763415 
9594-944 
54-2490 
32073922 
5187219 
3080523 
8387111 
830623 


9712536 
3507014 
33271076 
5153208 


“OOS OS 4 
3666-8 


ከዝ یں‎ 
H SD 


E) „Zone, .9247433 

3 ل5ت‎ 2J E ۶ھ نل‎ ۷۶ , B7AZJAA 
5 . 6564363 73335 lL TIBI BEES ONG 

ZI ۱ 
2 III ." ت 37 4- ت2‎ 1,2 73 5. 7 0 
E . 83761046 (3310034 1.073017 2403928 
1 6234781 71ء‎ 1.112662 .'"J18644-2 
17 IT . 86:13021 1.156004 . 3394381 
za s Zt رت ران‎ ee 1.204084 . Q8901507 
ea . 86ت‎ ነር SG - 7336817 71 ات‎ , BESEZAC 
mo . 63050773 273003538 1” 1:41 (ተን .3487 224 
c -60185-47 E SOC es Berge NU eC 
Eph 

J تج و 7ء‎ I Odo 73920784 
J ان لاه‎ OG کا و ےا و10‎ 7 ae 3ھ‎ 
1.) . m cj IZ 6646 01 ٦ ۲۰ ۱( ن‎ - 0 
17 Rena! 3-112045 15130375 212-3176 
1 EB op 12610194 1, be bog ; Oe = 
E ۱ 7/0 SS EO ٢۰ tSc 23064037 
2 . 2967045 A s 7۸ ۹۶ ۶ ۹ ጋ. ٦ 
ENS سوه‎ ۲۰7006 GZ ke] Ode: 





100 


55 


پر پر پر چو و پر تر یر چو پر KX‏ و پر پیر و چو پیر و چو TI TI I TI‏ ہار و و ይከ የከ I II I I TI TI‏ نز ከ‏ چي چي ከእ የእ TI TI‏ لو بر لر جر جز پر ار از ደጸ‏ چا جر از چو አ‏ جنر پر بر از جر که ېر 2# 6 م 
(xx xx‏ 
f x xX xx‏ 
NT PROGRAM TO DETERMINE MAXIMUM AND MINIMUM PRH T‏ 


xx xx 


fax xx 


6 از کر کر کر جر کر‎ ደ እጀ پر يف‎ KN KKK KC چر‎ እ جر کل پش‎ እጀ پر پر پش پر پر کل پس يش‎ TI IT KT بر پش‎ እ እያ ېر پر پر‎ KKK KKK KKK KKK KKK KKK KEK 


PI = 4*ATN(1) 
A ao LGMAH", "PRSUBI", "PRSUB2", "PRSUBI3","PRSUBA" 


PRINT 

PRMAX = 0 

PRMIN = 100 

'45 DEGREES OFF CENTER WOULD BE THE MAXIMUM CONSIDERED 
FOR ALPHAH = O TO 45 STEP 5 


ALPHAHEE = ALPHAH * PI/180 
PRINT ALPHAH 


‘THE CAMERA USED FOR THIS RESEARCH HAD HORIZ FOV OF 8-44 DEGREES 
FOR SIGMAH = 8 TO 44 STEP 4 
SIGMAHEE = SIGMAH * PI/180 
‘CASE 1: REFERENCE AT FAR RIGHT, POINT OF INTEREST AT FAR LEFT. 
PRSUB1 = COS(ALPHAHEE + SIGMAHEE*.5)/COS(ALPHAHEE + SIGMAHEE*(-.5. 
CASE 2: REEERENCE AT FAR REGHT, POINT OF INTEREST AT CENTER. 
PRSUB2 = COS(CALPHAHEE + SIGMAHEE*.5)/COS( ALPHAHEE) 
¡CSS OEESEDENCE AT CENTER: POINT OF INTEREST AT FAR RIGHT. 
PRSUB3 = COS(ALPHAHEE)/COS(ALPHAHEE * SIGMAHEE * .5) 
'CASE 4: REFERENCE AT CENTER, POINT OF INTEREST AT FAR LEFT. 
PRSUB4 = COS(ALPHAHEE) /COS(ALPHAHEE + SIGMAHEE *(-.5)) 
PRINT SPC(8);SIGMAH, PRSUB1, PRSUB2, PRSUB3, PRSUB4 
IF PRMAX < PRSUB1 THEN PRMAX = PRSUB1:MAX=SIGMAH: SUB=1 
IF PRMAX < PRSUB2 THEN PRMAX = PRSUB2:MAX=SIGMAH: SUB=2 
IF PRMAX < PRSUB3 THEN PRMAX = PRSUB3:MAX=SIGMAH: SUB=3 
IF PRMAX 6 PRSUB4 THEN PRMAX = PRSUB4:MAX=SIGMAH: SUB=4 
IF PRMIN > PRSUB1 THEN PRMIN = PRSUB1:MIN=SIGMAH: SU=1 
IF PRMIN > PRSUB2 THEN PRMIN = PRSUB2:MIN=SIGMAH:SU=2 
IF PRMIN ን PRSUB3 THEN PRMIN = PRSUB3:MIN=SIGMAH: SU=3 
IF PRMIN > PRSUB4 THEN PRMIN = PRSUB4:MIN=SIGMAH: SU=4 
NEXT SIGMAH 
PRMAX=0 
PRMIN=100 
MAX=0 
MIN=0 
SUB=0 
SU=0 
NEXT ALPHAH 
END 
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